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Abstract 

Bin packing is a well studied problem involved in many applications. The classical bin packing 
problem is about minimising the number of bins and ignores how the bins are utilised. We focus 
in this paper, on a variant of bin packing that is at the heart of efficient management of data 
centres. In this context, servers can be viewed as bins and virtual machines as items. The efficient 
management of a data-centre involves minimising energy costs while ensuring service quality. The 
assignment of virtual machines on servers and how these servers are utilised has a huge impact 
on the energy consumption. We focus on a bin packing problem where linear costs are associated 
to the use of bins to model the energy consumption. We study lower bounds based on Linear 
Programming and extend the bin packing global constraint with cost information. 
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1. Introduction 

Data centres are a critical and ubiquitous resource for providing infrastructure for banking, 
Internet and electronic commerce. They use enormous amounts of electricity, and this demand is 
expected to increase in the future. For example, a report by the EU Stand-by Initiative stated that 
in 2007 Western European data centres consumed 56 Tera-Watt Hours (TWh) of power, which 
is expected to almost double to 104 TWh per year by 2020. 1 Energy consumption is one of the 
most important sources of expense in data centres and energy efficiency is at the core of their 
competitive advantage. The ongoing increase in energy prices (a 50% increase is forecasted by the 
French senate by 2020) and the growing market for cloud computing are the main incentives for 
the design of energy efficient centres. Minimising energy consumption is not only important for 
economic reason but also for environmental reason. The consulting firm McKinsey reported that 
only 6-12% of electricity used by data centres can be attributed to the performance of productive 
computation [10]. Hence, there is a lot of scope for reducing the carbon footprint of data centres 
by utilising servers more efficiently. 
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We study a problem associated with the EnergeTIC 2 project which was accredited by the 
French government (FUI) [7] that brought together four companies (Bull, Business & Decision 
Eolas, Schneider Electric, ETXP), several academic partners (G2Elab, G-SCOP, LIG), and a startup 
company (Vesta-System). The objective is to control the energy consumption of a data center and 
ensure that it is consistent with application needs, economic constraints and service level agree¬ 
ments. We focus on how to reduce energy cost by taking cpu requirements of client applications, 
IT equipment and virtualisation techniques into account. 

The EnergeTIC optimisation problem for energy-cost aware data centre assignment systems is 
about allocating virtual machines demands to servers where the energy cost per unit of computa¬ 
tion can vary between different servers. 

The problem can be defined by a set of servers and a set of virtual applications to be run 
on those servers. Each server is associated with a set of available resources, e.g. CPU, RAM, 
DISK etc. Each virtual application is associated with a set of required resource values. The 
solution of the problem is an assignment of virtual machines to servers which respects a set of hard 
constraints. The objective is to take advantage of differences in energy costs across the servers, 
the requirements of virtual applications, and consolidate machine workload to ensure that servers 
are well utilised so that energy costs can be reduced. 

A combinatorial optimisation model for the problem of loading servers to a desired utilisation 
level has, at its core, a variant of bin packing (BP) problem [18]. In such a model each server 
is represented by a bin with a capacity equal to the amount of resource available. Bin packing 
is a very well studied NP-Hard problem. In the present context, energy consumption cannot be 
accurately modelled by only considering the number of active servers since the energy cost of a 
server is a function of the workload. Furthermore, servers require energy not only to run processes 
but also when they are idle which must be considered in the energy management process. We 
focus on an extension of bin packing problem with linear costs associated with the use of bins in 
Section 2. This is a key sub-problem of the application domain and we show how to handle it effi¬ 
ciently with Constraint Programming (CP). We study lower bounds based on Linear Programming 
and extend the bin packing global constraint with cost information. The present work has been 
partially published in [3] which focus on the real-life application. 

2. Bin Packing with Linear Usage Costs 

In this section we consider a variant of the Bin Packing problem (BP) [18], which is the key 
sub-problem of the application investigated here. We denote by S = {w [,..., w n } the integer sizes 
of the n items such that wq < w 2 < ...w,,. A bin j is characterised by an integer capacity Cj, a 
non-negative fixed cost /) and a non-negative cost cj for each unit of used capacity. We denote by 
B = {{Ci, /i, Ci),..., {C m , f n , c m }} the characteristics of the m bins. A bin is used when it contains 
at least one item. Its cost is a linear function fj + cjlj , where lj is the total size of the items in bin j. 
The total load is denoted by W - X"=i w, and the maximum capacity by C max = max\<j< m Cj. The 
problem is to assign each item to a bin subject to the capacity constraints so that we minimise the 


2 Minalogic EnergeTIC is a Global competitive cluster located in Grenoble France and fostering research-led in¬ 
novation in intelligent miniaturised products and solutions for industry. 
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sum of the costs of all bins. We refer to this problem as the Bin Packing with Usage Cost problem 
(BPUC). BP is a special case of BPUC where all fj are set to 1 and all cj to 0. The following 
example shows that a good solution for BP might not yield a good solution for BPUC. 


Example 1. In Figure 1, Scenario 1, B ={(9,0,1),(3,0,2),(3,0,2),(3,0,2),(3,0,2)} and S = {2,2,2,2,3,3,3}. 
Notice that V j,fj = 0. The packing (Pf) : {{2,2,2,2}, {5}, {5}, {5}, {}} is using the minimum number 
of bins and has a cost of 26 (8*1 + 3*2 + 3*2 + 3*2). The packing (P 2 ): {{3,3,3}, {2}, {2}, {2}, {2}} 
is using one more bin but has a cost of 25 (9 + 2*2 + 2*2 + 2*2 + 2*2). Here, (P 2 ) is better than 
(Pi) and using the minimum number of bins is not a good strategy. Now change the last unit cost 
to C 5 = 3 (see Figure 1, Scenario 2). The cost of (Pi) remains unchanged since it does not use bin 
number 5 but the cost of (P 2 ) increases to 27, and thus (Pi) is now better than (P 2 ). 



Scl: {9,0,1} {3,0.2} {3,0,2} {3,0,2} {3,0,2} 

Sc2: {9,0,1} {3,0,2} {3,0,2} {3,0,2} {3,0,3} 


Figure 1: Example of optimal solutions in two scenarios of costs. In Scenario 1, the best solution has no waste on the 
cheapest bin. In Scenario 2, it does not fill completely the cheapest bin. 

Literature Review. A first relevant extension of BP for the current paper is called Variable Size 
Bin-Packing, where bins have different capacities and the problem is to minimise the sum of the 
wasted space over all used bins [13]. It can be seen as a special case of BPUC where all fj = Cj 
and Cj = 0. Recent lower bounds and an exact approach are examined in [11]. A generalisation 
to any kind of fixed cost is presented in [4], which can be seen as a special case of BPUC where 
all Cj = 0. Concave costs of bin utilisation studied in [12] are more general than the linear cost 
functions of BPUC. However [12] does not consider bins of different capacities and deals with 
the performance of classical BP heuristics whereas we are focusing on lower bounds and exact 
algorithms. Secondly, BP with general cost structures have been introduced in [1] and studied 
in [8]. The authors investigated BP with non-decreasing and concave cost functions of the number 
of items put in a bin. They extend it with profitable optional items in [2]. Their framework can 
model a fixed cost but does not relate to bin usage. 

3. Basic Formulation and Lower Bounds of BPUC 

Numerous linear programming models have been proposed for BP [6]. We first present a 
formulation for BPUC. For each bin a binary variable y\j is set to 1 if bin j is used in the packing, 
and 0 otherwise. For each item i e 11,...,«} and each bin j e [1,..., m] a binary variable Xjj is set 
to 1 if item i is packed into bin j, and 0 otherwise. We add non-negative variables 1/ representing 
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the load of each bin j. The model is as follows: 


Minimize Zi = + cfij) 

(1.1) l l %ix ij = l, 

(1.2) Z'U w i x ij = l Jj 

(1.3) h < Cj}'j, 

(1.4) Xij € {0, l},y 7 € {0,1}, lj > 0 


Vi € {1,... ,n} 

Vj 6 {!,... ,m} 

Vj 6 {1.m} 

Vj 6 {!,...,m},Vi 6 {!,...,n} 


( 1 ) 


Constraint (1.1) states that each item is assigned to one bin whereas (1.2) and (1.3) enforce the 
capacity of the bins. We now characterise the linear relaxation of the model. Let r 7 = fj/Cj + Cj be 
a real number associated with bin j. If bin j is filled completely, r, is the cost of one unit of space 
in bin j. We sort the bins by non-decreasing ry. r ai < r a2 < ... < r„ m ; ct\,..., a m is a permutation 
of the bin indices 1,..., m. Let k be the minimum number of bins such that C Uj > W. 

Proposition 1. Let z\ be the optimal value of the linear relaxation of the formulation (1). We have 
Z\ > Lb ! with Lb ! = c aj r aj +(W- Z k ~\ C aj )r ak . 

Proof 1. z\ - 27=i (//>’./ + c jlj) - Z%i(frp- + c jlj) because of the constraint lj < Cyyj, so Zj > 

f. 1 

Tj"j=\(c~ +C j)lj — 27= 1 OO- bb\ is the quantity minimising YIJ=i r jh under the constraints Yjj lj = W 
where each f < Cj. To minimise the quantity we must split W over the f related to the smallest 
coefficients rj. Hence, z\ > YIJ=\ r jh ^ Lb 1 . □ 


Lb\ is a lower bound of BPUC that can be computed in Oimlogim ) + n). Also notice that Lb\ is 
the bound that we get by solving the linear relaxation of formulation (1). 

Proposition 2. Lh\ is the optimal value of the linear relaxation of the formulation (1). 

Proof 2. For all j < k, we set each y aj to 1 and l Uj to Cj. We fix l Uk to (W - Xy=i C a f) and y ak to 
la k /C ak . For cdl j > kwe sety aj = 0 and l aj = 0. Constraints (1.3) are thus satisfied. Finally we fix 

la- 

Xi aj = -j f for cdl i,j so that constraints (1.2) and (1.1) are satisfied. This is a feasible solution of 
the linear relaxation of(l) achieving an objective value of Lb \. We have, therefore, Lbi > z) and 
consequently z\ = Lbi from Proposition 1. □ 


Adding the constraint xq < yj for each item i and bin j, strengthens the linear relaxation only if 
W < C Uk . Indeed, the solution given in the proof is otherwise feasible for the constraint, (V/ < 

k, x Ua] = 'ff < y a j = 1 and for j = k we have ^ ^ if W > C a f). 

Dominance and symmetries.. Formulation (1) can be used to solve the problem exactly and strength¬ 
ened to take some bin and item symmetries into account. Typically, if bins i and j are such that 
f < fj, d < Cj and C, > Cj then bin i is said to dominate bin j and the constraints y 7 - < y, as well 
as f < f can be added to rule out some dominated solutions. Suppose now that we have k items 
that have the same size, any permutation of the items in a feasible solution remains feasible and 
have the same overall cost. A simple way to break this symmetry is to replace for each bin j, the 
k variables x ik 6 {0,1} corresponding to the identical items by a variable x- 6 {(), ... ,k\ giving the 
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number of the k items assigned to j (and adding Tjj x ) = k). We also perform in practice another 
simple pre-processing by tightening the capacities of the bins using dynamic programming. Given 
the item sizes and for each bin z, one can compute efficiently (when C, is sufficiently small) the 
largest integer less than or equal to C, that is also equal to the sum of a subset of S. 

4. Two Extended Formulations of BPUC 

The Cutting Stock Model. The formulation of Gilmore and Gomory for the cutting stock prob¬ 
lem [9] can be adapted for BPUC. The items of equal size are now grouped and for n < n different 
sizes we denote the number of items of sizes w' v ..., w , by q \,..., qp respectively. A cutting pat¬ 
tern for bin j is a combination of item sizes that fits into bin j using no more than q d items of size 
w',. In the i-th pattern of bin j, the number of items of size w' d that are in the pattern is denoted g di j. 
Let Ij be the set of all patterns for bin j. The cost of the z-th pattern (assumed to be non empty) 

of bin j is therefore equal to co,j - /) + Q^ =1 gdijW d )Cj. The cutting stock formulation is using a 
variable p t j for the z-th pattern of bin j: 

Minimize z 2 = Z"Li Zze/, co uPu 
(2-1) Z" i Z«e/ ; gdijP,j = qd 

(2-2) Zi€ijPij = 1 

(2.3) Pij g {0,1} 

Constraint (2.1) states that each item has to appear in a pattern (thus in a bin) and (2.2) enforces 
one pattern to be designed for each bin (convexity constraints). A pattern p lf for bin j is valid if 

Y!d=\ gdijw' d < Cj and all g dlJ are integers such that q d > g dij > 0. The sets 7 ; have an exponential 
size so the linear relaxation of this model can be solved using column generation. The pricing step 
is a knapsack problem that can be solved efficiently by dynamic programming if the capacities 
are small enough. The pricing problem for bin j can be written as follows (where n d is the dual 
variable associated to the d-th constraint (2.1) and Aj is the dual variable related to j-th constraint 
( 2 . 2 )): 

Minimize f,y + (£ d=1 g d w' d )cj - Za =i n dgd ~ 

2rf=i gdW d <C jy 

g d 6 {0, l,...,q d } Vd 6 {1,..., zz} 

ye {0,1} 

In practice, we solve the pricing problem using Dynamic Programming in 0(nCj). 

The Arc-Flow Model. Carvalho introduced an elegant Arc-Flow model for BP [5, 6]. His model 
explicitly uses each unit of capacity of the bins. In the following we show how to adapt it for 
BPUC. Consider a multi-graph G( V. A), where V = jO, 1 ,..., C max \ U j F\ is the set of C max + 2 nodes 
labelled from 0 to C max and a final node labelled F. and A = I U J is the set of two kinds of edges. 
An edge ( a , b) 6 / between two nodes labelled a < C max and b < C max represents the use of an item 
of size b - a. An edge of (a, F) e J for each bin j represents the usage a of the bin j, and therefore 
a < Cj. An example of such a graph is shown in Figure 3(a). Notice that this formulation has 


Td 6 {1, . . . , Z7 } 

Vj G {1,..., zzz} 

Vj G m},i G Ij 
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Figure 2: (a) An example of the graph underlying the Arc-Flow model for S = {2,2,3,5), B = 
{{3,1,2), {4,3,1},{7,3,3}) so that C max — 7. A packing is shown using a dotted line: {3} is put in the first bin for 
a cost of 7, {2,2} is in the second bin for a cost of 7 and {5} in the last bin for a cost of 18. (b) The graph underlying 
the Arc-Flow model after the elimination of symmetries. 


symmetries since a packing can be encoded by many different paths. Some reduction rules were 
given by Carvalho [5], which help in reducing such symmetries (see Figure 3(b)). 

BPUC can be seen as a minimum cost flow between 0 and F with constraints enforcing the 
number of edges of a given length used by the flow to be equal to the number of items of the 
corresponding size. We have variables x a b for each edge (a, b) € / as well as variables y aj for each 
pair of bin j € {1,..., m } and a e V. The cost of using an edge ( a , F) e J for bin j with a > 0 is 
co a j = fj + a ■ Cj and coqj = 0. The model is as follows: 


Minimize 


v - ' m sr'k=C,„ l 
2-ij=\ 2jk=o 


(3.1) Yj(a,b)eA x ab D(b,c)eA x bc ybj ~ | _ 

(3.2) Z C a L 0 y aj = l Vje{l,.. 

(3.3) Ti(k,k+w d )eA x k,k+w ll = Qd VJg{1,2, 

(3.4) y aj = o' ‘ VO', a) g {1 

(3.5) x ah e M V(a,b) e A 

(3.6) y a j e {0,1} V(j,a) 6 { 


c °kjykj 

,,, _( 0 Vb € {1,2,..., C max } 

Lj=i ybj | _ m f or i, — q 

y j G {1,..., m) 

VJg {1,2 

V( ,/- u) G { 1, . . . , Uz} X {67) + 1, . . . , Cma X } 
'Fa, b) e A 

V0‘, a) e {1,... ,m} x {0,... ,C mflX ) 


Constraint (3.1) enforces the flow conservation at each node, and Constraint (3.2) states that each 
bin should be used exactly once. Constraint (3.3) ensures that all the items are packed, while 
Constraint (3.4) enforces that bin j is not used beyond its capacity Cj. A solution can be obtained 
again by decomposing the flow into paths. The number of variables in this model is in 0((n + m) ■ 
Cmax) and the number of constraints is 0(C max + m + n ). Although its LP relaxation is stronger 
than that of Model (1), it remains dominated by that of Model (2). 


Proposition 3. z\ < Z* T The optimal value of the linear relaxation of (3) is less than the optimal 
value of the linear relaxation of (2). 


Proof 3. Let (p*) be a solution of the linear relaxation of (2). Each pattern p*. is mapped to a 
path of the Arc-Flow model. A fractional value pL is added on the arcs corresponding to the item 
sizes of the pattern (the value of the empty patterns for which cdl gdij = 0 is put on the arcs yoj). 
The flow conserx’ation (3.1) is satisfied by construction, so is (3.2) because of (2.2) and so are the 
demand constraints (3.3) because of (2.1). Any solution of (2) is thus encoded as a solution of (3) 
for the same cost so z.) < z* T □ 
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Proposition 4. z* 2 can be stronger than zj i.e. there exist instances such that z 2 > z* r 

Proof 4. Consider the following instance: S = {1,1,2} and B = {{3,1,1},{3,4,4}}. Two items of 
size 1 occurs so that n = 2, q\ - 2, q 2 = 1 corresponding to w, = 1, w 2 = 2. 77ze two bins /love 
to be used and the first dominates the second (the maximum possible space is used in bin 1 in any 
optimal solution) so the optimal solution is the packing {{2,1}, {1}} (cost of 12). Let’s compute the 
value ofz* 2 - It must fill the first bin with the pattern [gm,g2ii] = (1, 11 for a cost of A. Only three 
possible patterns can be used to fill the second bin: [0,0], [1,0] and [2,0] (a valid pattern p i2 is 
such that gu 2 < 2). The best solution is using [gm, £212] = [2,0] and [g 122 ,gm] = [0,0] taking 
both a 0.5 value to get a total cost z* 2 = 4 + 6 = 10. The Arc-Flow model uses a path to encode the 
same first pattern [1,1 [for bin 1. But it can build a path for bin 2 with a | unit of flow taking three 
consecutive arcs of size 1 to reach a better cost of \ * 16 ~ 5,33. This path would be a pattern 
[3,0] which is not valid for (2). So Z3 ~ 9.33 and z* 2 > z* y □ 

The Arc-Flow model may use a path containing more than q d arcs of size w' d with a positive 
flow whereas no such patterns exist in (3) because the sub-problem is subject to the constraint 
0 < gdij < qd- The cutting stock formulation used in [5] ignores this constraint and therefore the 
bounds are claimed to be equivalent. 

5. Extending the Bin Packing Global Constraint 

A bin packing global constraint was introduced for constraint programming by [17] and discus¬ 
sions about its filtering can be found in [16, 14]. We present an extension of this global constraint 
to handle BPUC. The scope and parameters are as follows: 


BinPackingUsageCost([.vi, ..., x n ], [h,..., l m ], [y 1,... ,y m ], b, z,S,B ) 

Variables x t e [1,..., m], f e [0,..., C,] and b e {1,..., m] denote the bin assigned to item i, 
the load of bin j, and the number of bins used, respectively. These variables are also used by the 
BinPacking constraint. Variables y, e {0,1} and z e R are due to the cost. They denote whether bin 
j is open, and the cost of the packing. The last two arguments refers to BPUC and give the size of 
the items as well as the costs (fixed and unit). In the following, x (resp. x) denotes the lower (resp. 
upper) bound of variable x. 

Cost-based Propagation using Lb x . The characteristics of the bins of the restricted BPUC prob¬ 
lem based on the current state of the domains of the variables is denoted by B, and defined by 
B = PC,,,/|, C|},..., \C m , f ' m ,c m \\ where C'■ - lj - 1, is the remaining capacity, and f. is the re¬ 
maining fixed cost fj = (1 - y[)fj that is set to 0 if the bin is known to be open. The total load 
that remains to be allocated to the bins is denoted W ’ = W - W=i lj- Notice that we use the lower 
bounds of the loads rather than the already packed items. We assume it is strictly better due to the 
reasoning of the bin packing constraint. 
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Lower bound of z. The first propagation rule is the update of the lower bound of z denoted by z. 
The bound is computed by summing the cost due to open bins and minimum loads with the value 
of Lb\ on the remaining problem. It gives a maximum possible increase in cost denoted by gap-. 

Lb\ = Z'J = \(lj c j + Jjfj) + LbfW\ B')\ z. <- max( z, Lb\); gap = z-Lb\ (4) 

Bounds of the load variables. We use the notation L, to denote the units of space used by 
Zi>i on bin a,. The bins ai,...,a k -i are fully used so Vj < k,Lj = C' a , for bin a k we have 
L k = W' - Xy=! C' and V / > k, Lj = 0. We define the bin packing problem B obtained by 
excluding the space supporting the lower bound LbfW , B ). The resulting bins are defined as 
B" = {{C[, f[,c \),..., {C' m ,f m ,c m }} where C". = 0 for all j < k. C". = C',, - L k and C". = C' a . for 
all j > k. Lower and upper bounds of loads are adjusted with rules (5). 

Let q~. be the largest quantity that can be removed from a bin a n with j < k, and put at the 
other cheapest possible place without overloading z. Consequently, when j < k, q~. is the largest 
value in [0, Lj] such that (Lb\{q~.,B") - q~.r aj ) < gap. When j = k, the same reasoning can be 
done by setting = 0 in B . 

Similarly, let q+. be the largest value in [0, C' a ] that can be put on a bin aj, with j > k, without 
triggering a contradiction with the remaining gap of cost, qf is thus the largest value in [0, C' a ] 
such that (,q + a r aj - (LbfW',B' ) - LbfW' - q + a ., B'))) < gap. 

Lj<k, /^ + Lj - q~ aj \ Vy > k, l aj *- + q + af (5) 

Channelling. The constraint ensures two simple rules relating the load and open-close variables (a 
bin of zero load can be open): y 7 = 0 => lj = 0 and f > 0 => y ; = 1. 

Bounds of the open-close variables. The propagation rule for 1/ can derive lj > 0 from (5), which 
in turn (because of the channelling between y and /) will force a bin to open i.e. y a . e {0,1} will 
change to y a . = 1. To derive that ay ; has to be fixed to 0, we can use Lb\ similarly to the reasoning 
presented for the load variables (checking that the increase of cost for opening a bin remains within 
the gap). 

Tightening the bounds of the load variables can trigger the existing filtering rules of the bin 
packing global constraint thus forbidding or committing items to bins. Notice that items are only 
increasing the cost indirectly by increasing the loads of the bins because the cost model is defined 
by the state of the bins (rather than the items). The cost-based propagation on x is thus performed 
by the bin packing global constraint solely as a consequence of the updates on the bin related 
variables, i.e. I and y. 

Filtering the load variables with dynamic programming. Another filtering rule based on dynamic 
programming can be added to the Bin-Packing global constraint. We can simply update f (resp. 

lj) to the smallest (resp. largest) integer greater than (resp. less than) or equal to the current value 
of lj (resp. f) that can be reached with the remaining items that can still go on bin j. We can 
solve this problem by dynamic programming (taking into account the items already assigned to j). 
This is generally very costly in practice and is not performed by default by the bin-packing global 
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Figure 3: Visualisation of the lower bound Lbi and optimal solution for the instance of example 2. The bins have 
been sorted by increasing r. 

constraint. Nevertheless, in this case, one can notice that Lb\ strongly relies on the accuracy of 
lj and lj and we observed that it can help significantly in practice for some hard instances. This 
technique has been originally proposed by [19] for knapsack constraints. 


Example 2. Let’s consider the following instance: B ={(9,9,5), (3,1,5), (7,14,3), (5,1,10), (12,12,10)} 
and S = {3,5,5, 5}. The total load is therefore W = 18 and the values or rj sorted increasingly are 
r 3 = 14/7 + 3 = 5 < ri = 5.33 < /y = 6 < r^ = 10.2 < r$ = 11. The lower bound is therefore 
Lb\ = 7 x 5 + 3 x 5.33 + 8 x 6 = 99. Figure 3 is visualising Lbi and shows the optimal solution. 
Propagation on the bounds of the load variables by dynamic programming is not performed here 
for the illustrative purposes. 

Assuming that we have an upper bound z of value 130, the gap is therefore initially equal to 
130 - 99 = 31. Initially Lb} = Lb\ as W' = W and B' = B. Afterwards, the propagation is able to 
deduce h = l\ = 1 as well as f = 6 as illustrated on the left of Figure 4. Indeed / 3 = 0 would lead 
to a lower bound of 3 x5.33 + 9x6 + 5x 10.2 +1*11 = 132 thus overloading z„ Similarly f = 0 
would give Lb[ = 135 > 130 and with l 5 = 1 we get Lb[ = 134 > 130. At this stage we know that 
bins 3 and 1 are open in any solution of cost less than 130. This is affecting the propagation as 
the fixed cost of bins 1 and 3 are now included in Lb} and propagation is strengthened. The cost 
due to open bins is 9 + 14 = 23 and due to lower bounds of loads of bins 1 and 3 is 5 + 3 = 8. 
Consequently, B' = {(8,0,5), (3,1,5), (6,0,3), (5,1,10), (6,12,10)} and W' = 16. Therefore, the 
r values are now ranked: r 3 = 3 < = 5 < r^ = 5.33 < r 4 = 10.2 < r 5 = 12. Notice that the 

ordering on bins have changed. The new value ofLb[ is 6x3 + 8x5 + 2x 5.33 = 68.66 and thus 
the new value of Lb} is 99.66. When the fix point is reached we know that I 3 = l\ =3 as well as 

h = 3. 

Let’s now add the propagation using dynamic programming (7 3 would typically be reduce to 
5). The lower bound observed at the root node is 119.66 whereas the one obtained by linear 
programming i.e. with formulation (1)( including the initial tightening of the capacities by dynamic 
programming) is 114.4. 

Algorithms and Complexity. Assuming that B and W' are available, LbfW' ,B ) can be com- 
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Upper bound = 130 




r 3 = 5 r 2 =5.33 r\ = 6 =10.2 r 5 = 11 


r 3 = 3 r\ = 5 r 2 =5.33 r 4 =10.2 rs = 12 


Step 1 


Fix point 


Figure 4: Propagation of lower and upper bounds of the load variables. The state of the domains at the end of the first 
iteration is depicted on the left. The lower bounds of l\ and 1% are increased to 1 and the upper bound of I 5 is decreased 
to 6. The state of the domains when the fix point is reached is shown on the right. 

puted in 0(m log(m )) time. Firstly we compute the r ; values corresponding to B for all bins. 
Secondly, we sort the bins in non-decreasing rj. Finally, the bound is computed by iterating over 
the sorted bins and the complexity is dominated by the sorting step. After computing Lb\(W ,B ), 
the values aj (the permutation of the bins) such that r ai < r„ 2 < ... < r am are available as well as 
the critical k and L k = W' - £*“} C • The update of l Uj and l a . can then be done in O(m) as shown 
in Figure 5. Notice that this process can be repeated until a fix point is reached. 

Algorithm UpdateMinimumLoad of Figure 5 is used to update the lower bounds of the load 
variables for each bin cij such that r aj — r a k ■ Recall that k is the number of bins supporting the 
lower bound Lb 1 . If j < k (resp. j = k) then the algorithm tries to find the largest quantity, denoted 
by q~., that can be removed from the bin aj and can be put on the bins a.b, where b > k (resp. 
b > k + 1), such that the increment in the cost remains smaller than the gap (Lines 4-10). The 
algorithm loops over the bins as long as the q~ has not reached its maximum amount on bin a t 
i.e. Lj and the gap has not been overloaded. When the increment in cost exceeds the gap then the 
minimum load if known. Similarly, UpdateMinimumLoad of Figure 5 is used to update the upper 
bounds of the load variables for each bin aj such that r aj 4 r ak . 

5.1. Dominance and symmetries. 

The dominance and symmetries previously mentioned can be similarly eliminated here. We 
go a step further and eliminate some dominances during search. In particular, for two bins i, j 
that are known to be open (y, = y ; = 1) such that c, < cj and C, > C n we can enforce /, > lj 
in the remaining sub-tree. Regarding item symmetries, we prefer to avoid changing the domain’s 
definitions (as done for Formulation (1)) to keep the semantics of the global constraint unchanged. 
Instead, each time a bin is proved to be infeasible for an item, the same value is removed from all 
the ungrounded items of the same size. 

5.2. Search. 

The propagation scheme relies on the knowledge of good upper bounds so the search process 
is driven by this need. We branch first on the bin variables y before the item’s variables x. We 
branch on the cheapest bins first by selecting the y with minimum r value and setting it to 1. The y 
variables are therefore sorted lexicographically by non-decreasing rj values. Once all y variables 
are grounded, we select the open bin k with the smallest slope (cy) and assign to it the largest item 
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Algorithm 1: UpdateMinimumLoad 


Input: 


with j< ft, B , gap 


Output: a lower bound of l a 


1. 

2 . 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

1®. 

11 . 


costInc 0; q a 


0; ft <— ft; 


If (j = ft) [b <— ft + 1;} 

While Cq~. < Lj && b < m ) 

load Add <— min(L ; - - q~ , C ab - Lb ); 
costlncb load Add X (r ab - r a ); 

If ((costlncb + costlnc ) > gap) 

, | gap-costlnc \ m 
+ L r „. -r„ -I » 


da, daj 


' a b 


return l a 

costlnc 


+ Li ■ 


daj ’ 


_J_ ■ -V 

costlnc + costlncb ; 


daj 

return l a 


da + load Add ; ft = b + 1; 


Algorithm 2: UpdateMaximumLoad 
Input: aj with / > /c , j9 , gap 
Output: an upper bound of l a 

I. costlnc 0; q^. 0; ft k ; 

2 . If (j = ft) {q+. L k ; ft <- ft - 1;} 

3 . While < CO && ft > 0) 

4. loadAdd <— min{Lb, C a - q +.); 

5 . costlncb <— loadAdd X (r a . - r ah ); 

6. If ((costlncb + costlnc ) > gap) 

7 «_ a + + I gap-costlnc , . 

7 ' d aj da, + L raj ._ ro4 J - 

8. return l aj + q „.; 

9. costlnc <— costlnc + costlncb ; 

1®. q+ <— q+ + loadAdd ; ft = ft - 1; 

II. return Z a 


Figure 5: Propagation algorithms for updating the lower and upper bounds of the load variables 

that can fit in a perfect packing of the bin. We thus check by dynamic programming that the item 
selected belongs to a subset of items that perfectly (up to its capacity) fill the bin. The search is 
binary, i.e. on the left branch we enforce x, = k and on the right branch we simply have x, A ft. 

5.3. Propagating a stronger lower bound. 

The lower bound z * 2 obtained by solving the linear relaxation of the cutting stock model can 
also be propagated by the constraint. However current restrictions of the domains have to be 
taken into account when computing the bound during search. The master and pricing problem 
are affected as we apply the bound by taking into account items already assigned (the capacity of 
the bins is reduced accordingly), and bins known to be open/close and currently possible items 
for each bin. The bound thus benefits from all other reasoning that acts on the domains. Since 
the pricing problem is time-consuming we found that the following techniques are important for 
efficiency: 

• We keep the columns of the previous call to the cutting stock model that are still compatible 
(feasible) with respect to the current domains. 

• When solving the pricing problem for a bin, an upper bound is first computed by sorting the 
items from the most beneficial to the least and adding them greedily as long as the capacity 
is not overloaded. If the column obtained has a negative reduced cost, the dynamic program¬ 
ming algorithm is not called and the algorithm moves on. This is a very common technique 
in column generation that pays off when the pricing is costly and the master very easy to 
solve (because it usually increases the number of iterations). The dynamic programming 
algorithm is called at least once to prove that no negative reduced cost columns still exist 
and ensure the validity of the bound. 
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Table 1: Comparison of the results obtained using MIP. Arc-Flow, and CP approaches approaches on random bin 
packing with usage cost problem instances with 600 seconds time-limit. 


n 

m 

X 

MIP 

CP 

CP+CG 

Arc-Flow 

CG 

n 

m 

X 

gap Z\ 

#ns 

cpu 

nodes 

#ns 

cpu 

nodes 

#ns 

cpu 

nodes 

gap 2 ^ 

#ns 

cpu 

nodes 

gap 4 

15 

10 

1 

4.81 

10 

1.2 

2112 

10 

0.3 

1270 

10 

0.3 

1109 

4.53 

10 

2.1 

2402 

4.47 

15 

10 

2 

2.91 

10 

1.1 

2050 

10 

0.1 

1020 

10 

0.1 

785 

1.81 

10 

0.7 

1508 

1.77 

15 

10 

3 

3.31 

10 

0.8 

1847 

10 

0.4 

4372 

10 

0.2 

972 

1.36 

10 

0.6 

420 

1.33 

25 

15 

1 

1.76 

10 

35.1 

29546 

10 

20.1 

140533 

10 

1.2 

1410 

1.60 

9 

47.5 

891 

1.59 

25 

15 

2 

1.91 

10 

74.2 

114068 

10 

10.1 

78297 

10 

0.8 

1498 

1.53 

10 

61.2 

12557 

1.51 

25 

15 

3 

2.60 

10 

22.6 

46350 

8 

26.9 

198792 

10 

7.3 

3359 

0.93 

10 

11.3 

1039 

0.93 

25 

25 

1 

1.67 

9 

19.0 

10876 

8 

14.5 

99250 

10 

1.3 

1394 

1.42 

9 

31.2 

4582 

1.42 

25 

25 

2 

1.47 

9 

97.4 

114530 

9 

17.6 

117327 

10 

2.1 

1706 

1.08 

10 

54.8 

5967 

1.07 

25 

25 

3 

2.30 

10 

33.7 

53462 

8 

109.5 

537144 

10 

2.2 

1816 

0.93 

10 

20.6 

510 

0.92 

200 

10 

1 

2.73 

10 

1.9 

1011 

10 

6.4 

14626 

10 

13.5 

14626 

2.73 

0 

- 

- 

2.73 

200 

10 

2 

2.16 

10 

18.9 

68121 

10 

2.9 

6641 

10 

10.3 

6641 

2.16 

0 

- 

- 

2.16 

200 

10 

3 

1.82 

10 

16.2 

93306 

10 

0.4 

4064 

10 

1.7 

4064 

1.82 

0 

- 

- 

1.82 

250 

15 

1 

1.53 

10 

3.9 

1627 

10 

7.4 

11159 

10 

11.7 

11159 

1.53 

0 

- 

- 

1.53 

250 

15 

2 

1.30 

9 

26.3 

50275 

10 

6.1 

12902 

10 

20.4 

12902 

7.21 

0 

- 

- 

1.30 

250 

15 

3 

0.96 

4 

190.9 

546599 

10 

4.8 

14111 

10 

23.7 

14111 

0.96 

0 

- 

- 

0.96 

500 

30 

1 

0.62 

10 

18.1 

7449 

10 

30.6 

14829 

10 

77.9 

14829 

0.62 

0 

- 

- 

0.62 

500 

30 

2 

0.46 

7 

173.2 

193274 

10 

16.7 

14340 

10 

78.4 

14340 

0.46 

0 

- 

- 

0.46 

500 

30 

3 

0.28 

1 

184.6 

322806 

10 

24.2 

21285 

9 

187.9 

19678 

0.28 

0 

- 

- 

0.28 


6. Experimental Results 

In this section we report the results for solving bin packing with usage cost problem to show 
the efficiency of the BPUC global constraint. 

We compare on randomly generated instances the lower bounds z* v z* 2 ,z* 3 as well as exact al¬ 
gorithms: Model (1), Arc-Flow Model (3), and two CP models using the BinPackingUsageCost 
constraint. The second CP model referred to as CP+CG activates the propagation of the z* bound 
by the BinPackingUsageCost constraint. Standard symmetry/dominance breaking techniques for 
BP are applied to the MIP [15] of Model (1) and CP [17]. A random instance is defined by 
(. n , m, X ), where n is the number of items (n e [15,25,200,250,500}), m is the number of bins (in e 
{10,15,25,30}), and parameter X e {1,2,3} denotes that the item sizes are uniformly randomly 
generated in the intervals [1,100], [20,100], and [50,100] respectively. The capacities of the bins 
are picked randomly from the sets [80, 100, 120, 150, 200, 250} and [800,1000,1200,1500, 2000, 
2500} when n e {15,25} and n e [200,250,500} respectively. The fixed cost of each bin is set to 
its capacity and the unit cost is randomly picked from the interval [0,1]. For each combination of 
(ft, m) e {(15,10), (25,15), (25,25), (200,10), (250,15), (500,30)} and A 6 {1,2,3} we generated 
10 instances giving 180 instances in total. 

The time-limit was 600 seconds. All the experiments were carried out on a Dual Quad Core 
Xeon CPU, running Linux 2.6.25 x64, with 11.76 GB of RAM, and 2.66 GHz processor speed. 
The LP solver used was CPLEX 12.5 (default parameters) and the CP solver was Choco 2.1.5. 
Table 1 reports the average cpu time in seconds (denoted cpu), the average number of nodes 
in the search tree (denoted nodes) and the average gap of the lower-bounds found at root node 
(denoted gapz * x ). This gap is computed as a percentage of the best known solution found. Column 
#ns gives the number of instances solved to optimality (i.e optimal value was found and proved 
optimal) within the time limit. 

The CP approach shows better performance when scaling to larger size instances (and capac- 
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ities) than the MIP and Arc-Flow models. Overall, the Arc-Flow model fails to solve optimally 
92 instances of the 180 instances, MIP fails on 21, CP fails on 7 and CP+CG only on 1. 

On one side, the search space is dramatically reduced by the propagation of z* 2 (CP+CG) for 
small problems. On the other side, the bound becomes ineffective on large problems (this can also 
be seen on the gap at the root node). 


7. Conclusion and Future Work 

Bin Packing with Usage Costs (BPUC) problem can be viewed as a core subproblem of many 
optimisation problems related with workload consolidation in data centres. The main contribu¬ 
tion of this paper is the study of various lower bounds and exact formulation for BPUC. Firstly, 
the value of the linear relaxation of a basic LP model for BPUC can be easily computed. Sec¬ 
ondly, we show that this bound can be strengthened with filtering algorithms reasoning on the 
minimum/maximum load of each bin and that the resulting CP approach can efficiently handle 
relatively large instances. 
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